Query Optimization over Cloud Data Market
نویسندگان
چکیده
Data market is an emerging type of cloud service that enables a data owner to sell their data sets in a public cloud. Buyers who are interested in a certain dataset can access the data in the market via a RESTful API. Accessing data in the data market may not be free. For example, it costs USD 12 per month to obtain 100 “transactions” from the WorldWide Historical Weather dataset in Windows Azure Data Marketplace, where a transaction is a unit of result size (e.g., a query result of 4400 records would consume 44 transactions as Windows Azure Data Marketplace confines one transaction to 100 records). Therefore, in this paper, we present PayLess, a system that helps data buyers to optimize their queries so that they can obtain the query results by paying less to the data sellers. Experiments over synthetic data and real data sets in Windows Azure Marketplace show that PayLess can cost-effectively handle SQL query processing over data markets.
منابع مشابه
Ant Colony Optimization Technique for Secure Various Data Retrieval in Cloud Computing
Data retrieval is the largest task in any large database, in the world’s largest data bases like cloud data retrieval is the one of the major issue. Retrieving the data and processing the query over cloud server is very difficult. Many searching technique are used for retrieving the data from cloud servers. It can be retrieved through an optimization technique. There are many data retrieval tec...
متن کاملRelational Databases Query Optimization using Hybrid Evolutionary Algorithm
Optimizing the database queries is one of hard research problems. Exhaustive search techniques like dynamic programming is suitable for queries with a few relations, but by increasing the number of relations in query, much use of memory and processing is needed, and the use of these methods is not suitable, so we have to use random and evolutionary methods. The use of evolutionary methods, beca...
متن کاملInteractive Multi-Objective Query Optimization in Mobile-Cloud Database Environments based on a Weighted Sum Model
Multiple cost objectives such as monetary cost, query execution time and mobile device energy consumption have to be considered for query optimization in mobile-cloud database environments where multiple users on mobile devices request services executed on a cloud. Requested data might be partially cached on the mobile device itself or has to be processed on the cloud which leads to those vario...
متن کاملCommunication-Aware Traffic Stream Optimization for Virtual Machine Placement in Cloud Datacenters with VL2 Topology
By pervasiveness of cloud computing, a colossal amount of applications from gigantic organizations increasingly tend to rely on cloud services. These demands caused a great number of applications in form of couple of virtual machines (VMs) requests to be executed on data centers’ servers. Some of applications are as big as not possible to be processed upon a single VM. Also, there exists severa...
متن کاملRecurring Job Optimization for Massively Distributed Query Processing
Companies providing cloud-scale data services have increasing needs to store and analyze massive data sets. For cost and performance reasons, processing is typically done on large clusters of tens of thousands of commodity machines. Developers use high-level scripting languages that simplify understanding various system trade-offs, but introduce new challenges for query optimization. One key op...
متن کامل